Dataset statistics
| Number of variables | 27 |
|---|---|
| Number of observations | 132241 |
| Missing cells | 615435 |
| Missing cells (%) | 17.2% |
| Duplicate rows | 5864 |
| Duplicate rows (%) | 4.4% |
| Total size in memory | 32.3 MiB |
| Average record size in memory | 256.0 B |
Variable types
| Categorical | 16 |
|---|---|
| Numeric | 11 |
| Dataset has 5864 (4.4%) duplicate rows | Duplicates |
EXTENSION_COUNT is highly imbalanced (55.6%) | Imbalance |
WIND_TURBINE_COUNT is highly imbalanced (99.2%) | Imbalance |
SOLAR_WATER_HEATING_FLAG is highly imbalanced (97.7%) | Imbalance |
BUILT_FORM has 5746 (4.3%) missing values | Missing |
MAINS_GAS_FLAG has 18965 (14.3%) missing values | Missing |
FLAT_TOP_STOREY has 60385 (45.7%) missing values | Missing |
FLAT_STOREY_COUNT has 118657 (89.7%) missing values | Missing |
MULTI_GLAZE_PROPORTION has 13523 (10.2%) missing values | Missing |
EXTENSION_COUNT has 18714 (14.2%) missing values | Missing |
NUMBER_HABITABLE_ROOMS has 18714 (14.2%) missing values | Missing |
NUMBER_HEATED_ROOMS has 18714 (14.2%) missing values | Missing |
LOW_ENERGY_LIGHTING has 6277 (4.7%) missing values | Missing |
NUMBER_OPEN_FIREPLACES has 3055 (2.3%) missing values | Missing |
WIND_TURBINE_COUNT has 9729 (7.4%) missing values | Missing |
FLOOR_HEIGHT has 63153 (47.8%) missing values | Missing |
PHOTO_SUPPLY has 53380 (40.4%) missing values | Missing |
SOLAR_WATER_HEATING_FLAG has 44350 (33.5%) missing values | Missing |
CONSTRUCTION_AGE_BAND has 14876 (11.2%) missing values | Missing |
FIXED_LIGHTING_OUTLETS_COUNT has 57395 (43.4%) missing values | Missing |
LOW_ENERGY_FIXED_LIGHT_COUNT has 89802 (67.9%) missing values | Missing |
NUMBER_OPEN_FIREPLACES is highly skewed (γ1 = 67.18573375) | Skewed |
FLOOR_HEIGHT is highly skewed (γ1 = 41.26470328) | Skewed |
PHOTO_SUPPLY is highly skewed (γ1 = 29.64736707) | Skewed |
LOW_ENERGY_FIXED_LIGHT_COUNT is highly skewed (γ1 = 21.2043161) | Skewed |
MULTI_GLAZE_PROPORTION has 9411 (7.1%) zeros | Zeros |
LOW_ENERGY_LIGHTING has 19391 (14.7%) zeros | Zeros |
NUMBER_OPEN_FIREPLACES has 120047 (90.8%) zeros | Zeros |
PHOTO_SUPPLY has 78723 (59.5%) zeros | Zeros |
LOW_ENERGY_FIXED_LIGHT_COUNT has 6491 (4.9%) zeros | Zeros |
Reproduction
| Analysis started | 2024-02-25 23:28:06.665648 |
|---|---|
| Analysis finished | 2024-02-25 23:28:42.433397 |
| Duration | 35.77 seconds |
| Software version | ydata-profiling vv4.6.4 |
| Download configuration | config.json |
CURRENT_ENERGY_RATING
Categorical
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.0 MiB |
| D | |
|---|---|
| C | |
| E | |
| B | |
| F | 3513 |
| Other values (2) | 1130 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 132241 |
|---|---|
| Distinct characters | 7 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | C |
|---|---|
| 2nd row | D |
| 3rd row | C |
| 4th row | C |
| 5th row | E |
Common Values
| Value | Count | Frequency (%) |
| D | 49010 | |
| C | 41672 | |
| E | 18568 | 14.0% |
| B | 18348 | 13.9% |
| F | 3513 | 2.7% |
| G | 770 | 0.6% |
| A | 360 | 0.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| d | 49010 | |
| c | 41672 | |
| e | 18568 | 14.0% |
| b | 18348 | 13.9% |
| f | 3513 | 2.7% |
| g | 770 | 0.6% |
| a | 360 | 0.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| D | 49010 | |
| C | 41672 | |
| E | 18568 | 14.0% |
| B | 18348 | 13.9% |
| F | 3513 | 2.7% |
| G | 770 | 0.6% |
| A | 360 | 0.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 132241 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| D | 49010 | |
| C | 41672 | |
| E | 18568 | 14.0% |
| B | 18348 | 13.9% |
| F | 3513 | 2.7% |
| G | 770 | 0.6% |
| A | 360 | 0.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 132241 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| D | 49010 | |
| C | 41672 | |
| E | 18568 | 14.0% |
| B | 18348 | 13.9% |
| F | 3513 | 2.7% |
| G | 770 | 0.6% |
| A | 360 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 132241 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| D | 49010 | |
| C | 41672 | |
| E | 18568 | 14.0% |
| B | 18348 | 13.9% |
| F | 3513 | 2.7% |
| G | 770 | 0.6% |
| A | 360 | 0.3% |
PROPERTY_TYPE
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.0 MiB |
| Flat | |
|---|---|
| House | |
| Maisonette | 6258 |
| Bungalow | 1995 |
| Park home | 29 |
Length
| Max length | 10 |
|---|---|
| Median length | 4 |
| Mean length | 4.7125324 |
| Min length | 4 |
Characters and Unicode
| Total characters | 623190 |
|---|---|
| Distinct characters | 21 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | House |
|---|---|
| 2nd row | House |
| 3rd row | Flat |
| 4th row | Flat |
| 5th row | Flat |
Common Values
| Value | Count | Frequency (%) |
| Flat | 75406 | |
| House | 48553 | |
| Maisonette | 6258 | 4.7% |
| Bungalow | 1995 | 1.5% |
| Park home | 29 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| flat | 75406 | |
| house | 48553 | |
| maisonette | 6258 | 4.7% |
| bungalow | 1995 | 1.5% |
| park | 29 | < 0.1% |
| home | 29 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 87922 | |
| a | 83688 | |
| l | 77401 | |
| F | 75406 | |
| e | 61098 | |
| o | 56835 | |
| s | 54811 | |
| u | 50548 | |
| H | 48553 | |
| n | 8253 | 1.3% |
| Other values (11) | 18675 | 3.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 490920 | |
| Uppercase Letter | 132241 | 21.2% |
| Space Separator | 29 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 87922 | |
| a | 83688 | |
| l | 77401 | |
| e | 61098 | |
| o | 56835 | |
| s | 54811 | |
| u | 50548 | |
| n | 8253 | 1.7% |
| i | 6258 | 1.3% |
| g | 1995 | 0.4% |
| Other values (5) | 2111 | 0.4% |
Uppercase Letter
| Value | Count | Frequency (%) |
| F | 75406 | |
| H | 48553 | |
| M | 6258 | 4.7% |
| B | 1995 | 1.5% |
| P | 29 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 29 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 623161 | |
| Common | 29 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 87922 | |
| a | 83688 | |
| l | 77401 | |
| F | 75406 | |
| e | 61098 | |
| o | 56835 | |
| s | 54811 | |
| u | 50548 | |
| H | 48553 | |
| n | 8253 | 1.3% |
| Other values (10) | 18646 | 3.0% |
Common
| Value | Count | Frequency (%) |
| 29 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 623190 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 87922 | |
| a | 83688 | |
| l | 77401 | |
| F | 75406 | |
| e | 61098 | |
| o | 56835 | |
| s | 54811 | |
| u | 50548 | |
| H | 48553 | |
| n | 8253 | 1.3% |
| Other values (11) | 18675 | 3.0% |
BUILT_FORM
Categorical
MISSING 
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 5746 |
| Missing (%) | 4.3% |
| Memory size | 6.0 MiB |
| Semi-Detached | |
|---|---|
| Mid-Terrace | |
| Detached | |
| End-Terrace | |
| Enclosed End-Terrace | 3655 |
Length
| Max length | 20 |
|---|---|
| Median length | 13 |
| Mean length | 11.590838 |
| Min length | 8 |
Characters and Unicode
| Total characters | 1466183 |
|---|---|
| Distinct characters | 20 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Semi-Detached |
|---|---|
| 2nd row | End-Terrace |
| 3rd row | Mid-Terrace |
| 4th row | Semi-Detached |
| 5th row | Semi-Detached |
Common Values
| Value | Count | Frequency (%) |
| Semi-Detached | 42022 | |
| Mid-Terrace | 35832 | |
| Detached | 22143 | |
| End-Terrace | 20151 | |
| Enclosed End-Terrace | 3655 | 2.8% |
| Enclosed Mid-Terrace | 2692 | 2.0% |
| (Missing) | 5746 | 4.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| semi-detached | 42022 | |
| mid-terrace | 38524 | |
| end-terrace | 23806 | |
| detached | 22143 | |
| enclosed | 6347 | 4.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 301359 | |
| d | 132842 | |
| c | 132842 | |
| a | 126495 | |
| r | 124660 | |
| - | 104352 | 7.1% |
| i | 80546 | 5.5% |
| D | 64165 | 4.4% |
| t | 64165 | 4.4% |
| h | 64165 | 4.4% |
| Other values (10) | 270592 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1118290 | |
| Uppercase Letter | 237194 | 16.2% |
| Dash Punctuation | 104352 | 7.1% |
| Space Separator | 6347 | 0.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 301359 | |
| d | 132842 | |
| c | 132842 | |
| a | 126495 | |
| r | 124660 | |
| i | 80546 | 7.2% |
| t | 64165 | 5.7% |
| h | 64165 | 5.7% |
| m | 42022 | 3.8% |
| n | 30153 | 2.7% |
| Other values (3) | 19041 | 1.7% |
Uppercase Letter
| Value | Count | Frequency (%) |
| D | 64165 | |
| T | 62330 | |
| S | 42022 | |
| M | 38524 | |
| E | 30153 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 104352 |
Space Separator
| Value | Count | Frequency (%) |
| 6347 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1355484 | |
| Common | 110699 | 7.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 301359 | |
| d | 132842 | |
| c | 132842 | |
| a | 126495 | |
| r | 124660 | |
| i | 80546 | 5.9% |
| D | 64165 | 4.7% |
| t | 64165 | 4.7% |
| h | 64165 | 4.7% |
| T | 62330 | 4.6% |
| Other values (8) | 201915 |
Common
| Value | Count | Frequency (%) |
| - | 104352 | |
| 6347 | 5.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1466183 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 301359 | |
| d | 132842 | |
| c | 132842 | |
| a | 126495 | |
| r | 124660 | |
| - | 104352 | 7.1% |
| i | 80546 | 5.5% |
| D | 64165 | 4.4% |
| t | 64165 | 4.4% |
| h | 64165 | 4.4% |
| Other values (10) | 270592 |
TOTAL_FLOOR_AREA
Real number (ℝ)
| Distinct | 12020 |
|---|---|
| Distinct (%) | 9.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 84.44702 |
| Minimum | 0 |
|---|---|
| Maximum | 3438 |
| Zeros | 305 |
| Zeros (%) | 0.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.0 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 29 |
| Q1 | 52.25 |
| median | 71 |
| Q3 | 98 |
| 95-th percentile | 180 |
| Maximum | 3438 |
| Range | 3438 |
| Interquartile range (IQR) | 45.75 |
Descriptive statistics
| Standard deviation | 63.545944 |
|---|---|
| Coefficient of variation (CV) | 0.7524948 |
| Kurtosis | 181.03443 |
| Mean | 84.44702 |
| Median Absolute Deviation (MAD) | 21 |
| Skewness | 8.0384356 |
| Sum | 11167358 |
| Variance | 4038.0869 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 50 | 1693 | 1.3% |
| 72 | 1627 | 1.2% |
| 64 | 1582 | 1.2% |
| 60 | 1572 | 1.2% |
| 70 | 1571 | 1.2% |
| 51 | 1527 | 1.2% |
| 65 | 1496 | 1.1% |
| 63 | 1489 | 1.1% |
| 66 | 1479 | 1.1% |
| 61 | 1478 | 1.1% |
| Other values (12010) | 116727 |
| Value | Count | Frequency (%) |
| 0 | 305 | |
| 0.1 | 2 | < 0.1% |
| 0.88 | 1 | < 0.1% |
| 1 | 1 | < 0.1% |
| 2.12 | 1 | < 0.1% |
| 2.3 | 1 | < 0.1% |
| 3 | 1 | < 0.1% |
| 3.2 | 1 | < 0.1% |
| 3.87 | 1 | < 0.1% |
| 4.17 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 3438 | 1 | |
| 2363 | 1 | |
| 2356 | 1 | |
| 2350.32 | 1 | |
| 1861.72 | 1 | |
| 1841 | 1 | |
| 1829 | 1 | |
| 1801 | 1 | |
| 1769 | 1 | |
| 1672 | 1 |
MAINS_GAS_FLAG
Categorical
MISSING 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 18965 |
| Missing (%) | 14.3% |
| Memory size | 6.0 MiB |
| 1.0 | |
|---|---|
| 0.0 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 339828 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1.0 |
|---|---|
| 2nd row | 1.0 |
| 3rd row | 1.0 |
| 4th row | 1.0 |
| 5th row | 1.0 |
Common Values
| Value | Count | Frequency (%) |
| 1.0 | 97000 | |
| 0.0 | 16276 | 12.3% |
| (Missing) | 18965 | 14.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1.0 | 97000 | |
| 0.0 | 16276 | 14.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 129552 | |
| . | 113276 | |
| 1 | 97000 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 226552 | |
| Other Punctuation | 113276 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 129552 | |
| 1 | 97000 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 113276 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 339828 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 129552 | |
| . | 113276 | |
| 1 | 97000 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 339828 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 129552 | |
| . | 113276 | |
| 1 | 97000 |
FLAT_TOP_STOREY
Categorical
MISSING 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 60385 |
| Missing (%) | 45.7% |
| Memory size | 6.0 MiB |
| 0.0 | |
|---|---|
| 1.0 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 215568 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1.0 |
|---|---|
| 2nd row | 1.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 1.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 44109 | |
| 1.0 | 27747 | |
| (Missing) | 60385 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0.0 | 44109 | |
| 1.0 | 27747 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 115965 | |
| . | 71856 | |
| 1 | 27747 | 12.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 143712 | |
| Other Punctuation | 71856 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 115965 | |
| 1 | 27747 | 19.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 71856 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 215568 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 115965 | |
| . | 71856 | |
| 1 | 27747 | 12.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 215568 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 115965 | |
| . | 71856 | |
| 1 | 27747 | 12.9% |
FLAT_STOREY_COUNT
Real number (ℝ)
MISSING 
| Distinct | 22 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 118657 |
| Missing (%) | 89.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.2533127 |
| Minimum | 0 |
|---|---|
| Maximum | 33 |
| Zeros | 2 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.0 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 2 |
| median | 3 |
| Q3 | 4 |
| 95-th percentile | 6 |
| Maximum | 33 |
| Range | 33 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.8183614 |
|---|---|
| Coefficient of variation (CV) | 0.55892611 |
| Kurtosis | 29.566848 |
| Mean | 3.2533127 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 4.3666021 |
| Sum | 44193 |
| Variance | 3.3064383 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 5972 | 4.5% |
| 2 | 4079 | 3.1% |
| 4 | 2090 | 1.6% |
| 5 | 557 | 0.4% |
| 6 | 255 | 0.2% |
| 1 | 122 | 0.1% |
| 7 | 102 | 0.1% |
| 8 | 98 | 0.1% |
| 11 | 80 | 0.1% |
| 10 | 51 | < 0.1% |
| Other values (12) | 178 | 0.1% |
| (Missing) | 118657 |
| Value | Count | Frequency (%) |
| 0 | 2 | < 0.1% |
| 1 | 122 | 0.1% |
| 2 | 4079 | |
| 3 | 5972 | |
| 4 | 2090 | 1.6% |
| 5 | 557 | 0.4% |
| 6 | 255 | 0.2% |
| 7 | 102 | 0.1% |
| 8 | 98 | 0.1% |
| 9 | 42 | < 0.1% |
| Value | Count | Frequency (%) |
| 33 | 1 | < 0.1% |
| 31 | 1 | < 0.1% |
| 19 | 1 | < 0.1% |
| 18 | 10 | < 0.1% |
| 17 | 14 | < 0.1% |
| 16 | 30 | |
| 15 | 39 | |
| 14 | 14 | < 0.1% |
| 13 | 6 | < 0.1% |
| 12 | 18 |
MULTI_GLAZE_PROPORTION
Real number (ℝ)
MISSING  ZEROS 
| Distinct | 101 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 13523 |
| Missing (%) | 10.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 87.491341 |
| Minimum | 0 |
|---|---|
| Maximum | 100 |
| Zeros | 9411 |
| Zeros (%) | 7.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.0 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 100 |
| median | 100 |
| Q3 | 100 |
| 95-th percentile | 100 |
| Maximum | 100 |
| Range | 100 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 30.018473 |
|---|---|
| Coefficient of variation (CV) | 0.34310221 |
| Kurtosis | 3.5692313 |
| Mean | 87.491341 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | -2.2832524 |
| Sum | 10386797 |
| Variance | 901.10871 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 100 | 94416 | |
| 0 | 9411 | 7.1% |
| 90 | 1963 | 1.5% |
| 95 | 1609 | 1.2% |
| 50 | 1395 | 1.1% |
| 80 | 1143 | 0.9% |
| 60 | 693 | 0.5% |
| 85 | 677 | 0.5% |
| 40 | 627 | 0.5% |
| 75 | 607 | 0.5% |
| Other values (91) | 6177 | 4.7% |
| (Missing) | 13523 | 10.2% |
| Value | Count | Frequency (%) |
| 0 | 9411 | |
| 1 | 7 | < 0.1% |
| 2 | 5 | < 0.1% |
| 3 | 7 | < 0.1% |
| 4 | 4 | < 0.1% |
| 5 | 174 | 0.1% |
| 6 | 16 | < 0.1% |
| 7 | 10 | < 0.1% |
| 8 | 20 | < 0.1% |
| 9 | 13 | < 0.1% |
| Value | Count | Frequency (%) |
| 100 | 94416 | |
| 99 | 53 | < 0.1% |
| 98 | 187 | 0.1% |
| 97 | 73 | 0.1% |
| 96 | 83 | 0.1% |
| 95 | 1609 | 1.2% |
| 94 | 61 | < 0.1% |
| 93 | 49 | < 0.1% |
| 92 | 121 | 0.1% |
| 91 | 57 | < 0.1% |
EXTENSION_COUNT
Categorical
IMBALANCE  MISSING 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 18714 |
| Missing (%) | 14.2% |
| Memory size | 6.0 MiB |
| 0.0 | |
|---|---|
| 1.0 | |
| 2.0 | 5131 |
| 3.0 | 873 |
| 4.0 | 223 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 340581 |
|---|---|
| Distinct characters | 6 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2.0 |
|---|---|
| 2nd row | 1.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 2.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 85954 | |
| 1.0 | 21346 | 16.1% |
| 2.0 | 5131 | 3.9% |
| 3.0 | 873 | 0.7% |
| 4.0 | 223 | 0.2% |
| (Missing) | 18714 | 14.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0.0 | 85954 | |
| 1.0 | 21346 | 18.8% |
| 2.0 | 5131 | 4.5% |
| 3.0 | 873 | 0.8% |
| 4.0 | 223 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 199481 | |
| . | 113527 | |
| 1 | 21346 | 6.3% |
| 2 | 5131 | 1.5% |
| 3 | 873 | 0.3% |
| 4 | 223 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 227054 | |
| Other Punctuation | 113527 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 199481 | |
| 1 | 21346 | 9.4% |
| 2 | 5131 | 2.3% |
| 3 | 873 | 0.4% |
| 4 | 223 | 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 113527 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 340581 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 199481 | |
| . | 113527 | |
| 1 | 21346 | 6.3% |
| 2 | 5131 | 1.5% |
| 3 | 873 | 0.3% |
| 4 | 223 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 340581 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 199481 | |
| . | 113527 | |
| 1 | 21346 | 6.3% |
| 2 | 5131 | 1.5% |
| 3 | 873 | 0.3% |
| 4 | 223 | 0.1% |
NUMBER_HABITABLE_ROOMS
Real number (ℝ)
MISSING 
| Distinct | 30 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 18714 |
| Missing (%) | 14.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.8495776 |
| Minimum | 1 |
|---|---|
| Maximum | 61 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.0 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 3 |
| median | 3 |
| Q3 | 5 |
| 95-th percentile | 7 |
| Maximum | 61 |
| Range | 60 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.9323382 |
|---|---|
| Coefficient of variation (CV) | 0.50196109 |
| Kurtosis | 16.917343 |
| Mean | 3.8495776 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.7765379 |
| Sum | 437031 |
| Variance | 3.7339309 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 34767 | |
| 4 | 19001 | |
| 2 | 17939 | |
| 5 | 15736 | |
| 6 | 8490 | 6.4% |
| 1 | 7186 | 5.4% |
| 7 | 5023 | 3.8% |
| 8 | 2674 | 2.0% |
| 9 | 1367 | 1.0% |
| 10 | 694 | 0.5% |
| Other values (20) | 650 | 0.5% |
| (Missing) | 18714 |
| Value | Count | Frequency (%) |
| 1 | 7186 | 5.4% |
| 2 | 17939 | |
| 3 | 34767 | |
| 4 | 19001 | |
| 5 | 15736 | |
| 6 | 8490 | 6.4% |
| 7 | 5023 | 3.8% |
| 8 | 2674 | 2.0% |
| 9 | 1367 | 1.0% |
| 10 | 694 | 0.5% |
| Value | Count | Frequency (%) |
| 61 | 1 | < 0.1% |
| 54 | 1 | < 0.1% |
| 43 | 1 | < 0.1% |
| 40 | 1 | < 0.1% |
| 30 | 1 | < 0.1% |
| 25 | 2 | |
| 24 | 2 | |
| 23 | 3 | |
| 22 | 2 | |
| 21 | 3 |
NUMBER_HEATED_ROOMS
Real number (ℝ)
MISSING 
| Distinct | 28 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 18714 |
| Missing (%) | 14.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.8157795 |
| Minimum | 0 |
|---|---|
| Maximum | 43 |
| Zeros | 402 |
| Zeros (%) | 0.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.0 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 3 |
| median | 3 |
| Q3 | 5 |
| 95-th percentile | 7 |
| Maximum | 43 |
| Range | 43 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.9222398 |
|---|---|
| Coefficient of variation (CV) | 0.5037607 |
| Kurtosis | 5.125261 |
| Mean | 3.8157795 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.3250627 |
| Sum | 433194 |
| Variance | 3.6950057 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 34650 | |
| 4 | 18919 | |
| 2 | 18056 | |
| 5 | 15452 | |
| 6 | 8402 | 6.4% |
| 1 | 7426 | 5.6% |
| 7 | 4921 | 3.7% |
| 8 | 2647 | 2.0% |
| 9 | 1342 | 1.0% |
| 10 | 680 | 0.5% |
| Other values (18) | 1032 | 0.8% |
| (Missing) | 18714 |
| Value | Count | Frequency (%) |
| 0 | 402 | 0.3% |
| 1 | 7426 | 5.6% |
| 2 | 18056 | |
| 3 | 34650 | |
| 4 | 18919 | |
| 5 | 15452 | |
| 6 | 8402 | 6.4% |
| 7 | 4921 | 3.7% |
| 8 | 2647 | 2.0% |
| 9 | 1342 | 1.0% |
| Value | Count | Frequency (%) |
| 43 | 1 | < 0.1% |
| 30 | 1 | < 0.1% |
| 25 | 2 | < 0.1% |
| 24 | 2 | < 0.1% |
| 23 | 1 | < 0.1% |
| 22 | 2 | < 0.1% |
| 21 | 3 | |
| 20 | 4 | |
| 19 | 4 | |
| 18 | 5 |
LOW_ENERGY_LIGHTING
Real number (ℝ)
MISSING  ZEROS 
| Distinct | 106 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 6277 |
| Missing (%) | 4.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 59.540488 |
| Minimum | 0 |
|---|---|
| Maximum | 145 |
| Zeros | 19391 |
| Zeros (%) | 14.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.0 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 20 |
| median | 68 |
| Q3 | 100 |
| 95-th percentile | 100 |
| Maximum | 145 |
| Range | 145 |
| Interquartile range (IQR) | 80 |
Descriptive statistics
| Standard deviation | 39.444433 |
|---|---|
| Coefficient of variation (CV) | 0.66248086 |
| Kurtosis | -1.4954277 |
| Mean | 59.540488 |
| Median Absolute Deviation (MAD) | 32 |
| Skewness | -0.34016127 |
| Sum | 7499958 |
| Variance | 1555.8633 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 100 | 46624 | |
| 0 | 19391 | |
| 50 | 5097 | 3.9% |
| 80 | 2345 | 1.8% |
| 67 | 2299 | 1.7% |
| 20 | 2270 | 1.7% |
| 33 | 2209 | 1.7% |
| 75 | 2173 | 1.6% |
| 25 | 2073 | 1.6% |
| 60 | 2029 | 1.5% |
| Other values (96) | 39454 | |
| (Missing) | 6277 | 4.7% |
| Value | Count | Frequency (%) |
| 0 | 19391 | |
| 1 | 256 | 0.2% |
| 2 | 187 | 0.1% |
| 3 | 439 | 0.3% |
| 4 | 560 | 0.4% |
| 5 | 618 | 0.5% |
| 6 | 641 | 0.5% |
| 7 | 586 | 0.4% |
| 8 | 757 | 0.6% |
| 9 | 465 | 0.4% |
| Value | Count | Frequency (%) |
| 145 | 1 | < 0.1% |
| 142 | 1 | < 0.1% |
| 107 | 1 | < 0.1% |
| 103 | 1 | < 0.1% |
| 101 | 4 | < 0.1% |
| 100 | 46624 | |
| 99 | 9 | < 0.1% |
| 98 | 34 | < 0.1% |
| 97 | 68 | 0.1% |
| 96 | 88 | 0.1% |
NUMBER_OPEN_FIREPLACES
Real number (ℝ)
MISSING  SKEWED  ZEROS 
| Distinct | 13 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 3055 |
| Missing (%) | 2.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.097309306 |
| Minimum | 0 |
|---|---|
| Maximum | 100 |
| Zeros | 120047 |
| Zeros (%) | 90.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.0 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 100 |
| Range | 100 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.49520458 |
|---|---|
| Coefficient of variation (CV) | 5.0889746 |
| Kurtosis | 12850.273 |
| Mean | 0.097309306 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 67.185734 |
| Sum | 12571 |
| Variance | 0.24522758 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 120047 | |
| 1 | 6899 | 5.2% |
| 2 | 1606 | 1.2% |
| 3 | 350 | 0.3% |
| 4 | 186 | 0.1% |
| 5 | 55 | < 0.1% |
| 6 | 22 | < 0.1% |
| 7 | 8 | < 0.1% |
| 8 | 8 | < 0.1% |
| 9 | 2 | < 0.1% |
| Other values (3) | 3 | < 0.1% |
| (Missing) | 3055 | 2.3% |
| Value | Count | Frequency (%) |
| 0 | 120047 | |
| 1 | 6899 | 5.2% |
| 2 | 1606 | 1.2% |
| 3 | 350 | 0.3% |
| 4 | 186 | 0.1% |
| 5 | 55 | < 0.1% |
| 6 | 22 | < 0.1% |
| 7 | 8 | < 0.1% |
| 8 | 8 | < 0.1% |
| 9 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 100 | 1 | < 0.1% |
| 11 | 1 | < 0.1% |
| 10 | 1 | < 0.1% |
| 9 | 2 | < 0.1% |
| 8 | 8 | < 0.1% |
| 7 | 8 | < 0.1% |
| 6 | 22 | < 0.1% |
| 5 | 55 | < 0.1% |
| 4 | 186 | |
| 3 | 350 |
WIND_TURBINE_COUNT
Categorical
IMBALANCE  MISSING 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 9729 |
| Missing (%) | 7.4% |
| Memory size | 6.0 MiB |
| 0.0 | |
|---|---|
| 1.0 | 63 |
| -1.0 | 54 |
Length
| Max length | 4 |
|---|---|
| Median length | 3 |
| Mean length | 3.0004408 |
| Min length | 3 |
Characters and Unicode
| Total characters | 367590 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 3 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 122395 | |
| 1.0 | 63 | < 0.1% |
| -1.0 | 54 | < 0.1% |
| (Missing) | 9729 | 7.4% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0.0 | 122395 | |
| 1.0 | 117 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 244907 | |
| . | 122512 | |
| 1 | 117 | < 0.1% |
| - | 54 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 245024 | |
| Other Punctuation | 122512 | |
| Dash Punctuation | 54 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 244907 | |
| 1 | 117 | < 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 122512 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 54 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 367590 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 244907 | |
| . | 122512 | |
| 1 | 117 | < 0.1% |
| - | 54 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 367590 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 244907 | |
| . | 122512 | |
| 1 | 117 | < 0.1% |
| - | 54 | < 0.1% |
FLOOR_HEIGHT
Real number (ℝ)
MISSING  SKEWED 
| Distinct | 668 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 63153 |
| Missing (%) | 47.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.4876789 |
| Minimum | 0 |
|---|---|
| Maximum | 37.72 |
| Zeros | 29 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.0 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 2.28 |
| Q1 | 2.38 |
| median | 2.44 |
| Q3 | 2.58 |
| 95-th percentile | 2.82 |
| Maximum | 37.72 |
| Range | 37.72 |
| Interquartile range (IQR) | 0.2 |
Descriptive statistics
| Standard deviation | 0.35592388 |
|---|---|
| Coefficient of variation (CV) | 0.14307468 |
| Kurtosis | 2972.8762 |
| Mean | 2.4876789 |
| Median Absolute Deviation (MAD) | 0.09 |
| Skewness | 41.264703 |
| Sum | 171868.76 |
| Variance | 0.12668181 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2.4 | 8562 | 6.5% |
| 2.5 | 6302 | 4.8% |
| 2.3 | 3994 | 3.0% |
| 2.6 | 3009 | 2.3% |
| 2.41 | 2793 | 2.1% |
| 2.7 | 1927 | 1.5% |
| 2.45 | 1804 | 1.4% |
| 2.35 | 1583 | 1.2% |
| 2.42 | 1557 | 1.2% |
| 2.43 | 1513 | 1.1% |
| Other values (658) | 36044 | |
| (Missing) | 63153 |
| Value | Count | Frequency (%) |
| 0 | 29 | < 0.1% |
| 0.1 | 2 | < 0.1% |
| 0.23 | 1 | < 0.1% |
| 1 | 202 | |
| 1.5 | 29 | < 0.1% |
| 1.51 | 1 | < 0.1% |
| 1.58 | 1 | < 0.1% |
| 1.59 | 1 | < 0.1% |
| 1.6 | 3 | < 0.1% |
| 1.63 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 37.72 | 1 | |
| 28.35 | 1 | |
| 25 | 1 | |
| 24.18 | 1 | |
| 24 | 1 | |
| 23.7 | 1 | |
| 21.91 | 1 | |
| 19.95 | 1 | |
| 19.42 | 1 | |
| 14.81 | 1 |
PHOTO_SUPPLY
Real number (ℝ)
MISSING  SKEWED  ZEROS 
| Distinct | 19 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 53380 |
| Missing (%) | 40.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.062857433 |
| Minimum | 0 |
|---|---|
| Maximum | 80 |
| Zeros | 78723 |
| Zeros (%) | 59.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.0 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 80 |
| Range | 80 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 1.6343984 |
|---|---|
| Coefficient of variation (CV) | 26.001673 |
| Kurtosis | 975.91442 |
| Mean | 0.062857433 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 29.647367 |
| Sum | 4957 |
| Variance | 2.6712581 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 78723 | |
| 40 | 25 | < 0.1% |
| 20 | 22 | < 0.1% |
| 50 | 20 | < 0.1% |
| 35 | 15 | < 0.1% |
| 25 | 13 | < 0.1% |
| 30 | 9 | < 0.1% |
| 70 | 7 | < 0.1% |
| 33 | 5 | < 0.1% |
| 45 | 5 | < 0.1% |
| Other values (9) | 17 | < 0.1% |
| (Missing) | 53380 |
| Value | Count | Frequency (%) |
| 0 | 78723 | |
| 5 | 1 | < 0.1% |
| 9 | 2 | < 0.1% |
| 10 | 3 | < 0.1% |
| 15 | 4 | < 0.1% |
| 20 | 22 | < 0.1% |
| 25 | 13 | < 0.1% |
| 26 | 1 | < 0.1% |
| 28 | 1 | < 0.1% |
| 30 | 9 | < 0.1% |
| Value | Count | Frequency (%) |
| 80 | 1 | < 0.1% |
| 75 | 2 | < 0.1% |
| 70 | 7 | < 0.1% |
| 60 | 2 | < 0.1% |
| 50 | 20 | |
| 45 | 5 | < 0.1% |
| 40 | 25 | |
| 35 | 15 | |
| 33 | 5 | < 0.1% |
| 30 | 9 | < 0.1% |
SOLAR_WATER_HEATING_FLAG
Categorical
IMBALANCE  MISSING 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 44350 |
| Missing (%) | 33.5% |
| Memory size | 6.0 MiB |
| 0.0 | |
|---|---|
| 1.0 | 196 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 263673 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 87695 | |
| 1.0 | 196 | 0.1% |
| (Missing) | 44350 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0.0 | 87695 | |
| 1.0 | 196 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 175586 | |
| . | 87891 | |
| 1 | 196 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 175782 | |
| Other Punctuation | 87891 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 175586 | |
| 1 | 196 | 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 87891 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 263673 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 175586 | |
| . | 87891 | |
| 1 | 196 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 263673 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 175586 | |
| . | 87891 | |
| 1 | 196 | 0.1% |
CONSTRUCTION_AGE_BAND
Categorical
MISSING 
| Distinct | 13 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 14876 |
| Missing (%) | 11.2% |
| Memory size | 6.0 MiB |
| England and Wales: 1930-1949 | |
|---|---|
| England and Wales: 1900-1929 | |
| England and Wales: 1950-1966 | |
| England and Wales: 1967-1975 | |
| England and Wales: 1983-1990 | |
| Other values (8) |
Length
| Max length | 31 |
|---|---|
| Median length | 28 |
| Mean length | 28.271657 |
| Min length | 28 |
Characters and Unicode
| Total characters | 3318103 |
|---|---|
| Distinct characters | 27 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | England and Wales: 1900-1929 |
|---|---|
| 2nd row | England and Wales: 1900-1929 |
| 3rd row | England and Wales: 1900-1929 |
| 4th row | England and Wales: 2003-2006 |
| 5th row | England and Wales: 1930-1949 |
Common Values
| Value | Count | Frequency (%) |
| England and Wales: 1930-1949 | 34178 | |
| England and Wales: 1900-1929 | 24140 | |
| England and Wales: 1950-1966 | 12746 | 9.6% |
| England and Wales: 1967-1975 | 9472 | 7.2% |
| England and Wales: 1983-1990 | 7214 | 5.5% |
| England and Wales: 2012 onwards | 5506 | 4.2% |
| England and Wales: 1976-1982 | 4571 | 3.5% |
| England and Wales: before 1900 | 4540 | 3.4% |
| England and Wales: 1996-2002 | 4166 | 3.2% |
| England and Wales: 2003-2006 | 3812 | 2.9% |
| Other values (3) | 7020 | 5.3% |
| (Missing) | 14876 |
Length
| Value | Count | Frequency (%) |
| england | 117365 | |
| and | 117365 | |
| wales | 117365 | |
| 1930-1949 | 34178 | 7.1% |
| 1900-1929 | 24140 | 5.0% |
| 1950-1966 | 12746 | 2.6% |
| 1967-1975 | 9472 | 2.0% |
| onwards | 7601 | 1.6% |
| 1983-1990 | 7214 | 1.5% |
| 2012 | 5506 | 1.1% |
| Other values (8) | 28649 | 5.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| 364236 | ||
| a | 359696 | |
| n | 359696 | |
| 9 | 278198 | 8.4% |
| d | 242331 | 7.3% |
| l | 234730 | 7.1% |
| 1 | 212492 | 6.4% |
| 0 | 148185 | 4.5% |
| e | 126445 | 3.8% |
| s | 124966 | 3.8% |
| Other values (17) | 867128 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1606192 | |
| Decimal Number | 890356 | |
| Space Separator | 364236 | 11.0% |
| Uppercase Letter | 234730 | 7.1% |
| Other Punctuation | 117365 | 3.5% |
| Dash Punctuation | 105224 | 3.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 359696 | |
| n | 359696 | |
| d | 242331 | |
| l | 234730 | |
| e | 126445 | 7.9% |
| s | 124966 | 7.8% |
| g | 117365 | 7.3% |
| o | 12141 | 0.8% |
| r | 12141 | 0.8% |
| w | 7601 | 0.5% |
| Other values (2) | 9080 | 0.6% |
Decimal Number
| Value | Count | Frequency (%) |
| 9 | 278198 | |
| 1 | 212492 | |
| 0 | 148185 | |
| 2 | 60048 | 6.7% |
| 6 | 47513 | 5.3% |
| 3 | 45204 | 5.1% |
| 4 | 34178 | 3.8% |
| 7 | 26747 | 3.0% |
| 5 | 26006 | 2.9% |
| 8 | 11785 | 1.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 117365 | |
| W | 117365 |
Space Separator
| Value | Count | Frequency (%) |
| 364236 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 117365 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 105224 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1840922 | |
| Common | 1477181 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 359696 | |
| n | 359696 | |
| d | 242331 | |
| l | 234730 | |
| e | 126445 | 6.9% |
| s | 124966 | 6.8% |
| E | 117365 | 6.4% |
| W | 117365 | 6.4% |
| g | 117365 | 6.4% |
| o | 12141 | 0.7% |
| Other values (4) | 28822 | 1.6% |
Common
| Value | Count | Frequency (%) |
| 364236 | ||
| 9 | 278198 | |
| 1 | 212492 | |
| 0 | 148185 | |
| : | 117365 | 7.9% |
| - | 105224 | 7.1% |
| 2 | 60048 | 4.1% |
| 6 | 47513 | 3.2% |
| 3 | 45204 | 3.1% |
| 4 | 34178 | 2.3% |
| Other values (3) | 64538 | 4.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3318103 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 364236 | ||
| a | 359696 | |
| n | 359696 | |
| 9 | 278198 | 8.4% |
| d | 242331 | 7.3% |
| l | 234730 | 7.1% |
| 1 | 212492 | 6.4% |
| 0 | 148185 | 4.5% |
| e | 126445 | 3.8% |
| s | 124966 | 3.8% |
| Other values (17) | 867128 |
FIXED_LIGHTING_OUTLETS_COUNT
Real number (ℝ)
MISSING 
| Distinct | 158 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 57395 |
| Missing (%) | 43.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 11.9839 |
| Minimum | 0 |
|---|---|
| Maximum | 965 |
| Zeros | 747 |
| Zeros (%) | 0.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.0 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 6 |
| median | 9 |
| Q3 | 13 |
| 95-th percentile | 30 |
| Maximum | 965 |
| Range | 965 |
| Interquartile range (IQR) | 7 |
Descriptive statistics
| Standard deviation | 12.596159 |
|---|---|
| Coefficient of variation (CV) | 1.0510901 |
| Kurtosis | 666.92946 |
| Mean | 11.9839 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 14.001332 |
| Sum | 896947 |
| Variance | 158.66323 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 10 | 9151 | 6.9% |
| 8 | 7315 | 5.5% |
| 6 | 7171 | 5.4% |
| 7 | 6239 | 4.7% |
| 12 | 5263 | 4.0% |
| 9 | 4744 | 3.6% |
| 5 | 4254 | 3.2% |
| 1 | 3225 | 2.4% |
| 11 | 2795 | 2.1% |
| 4 | 2619 | 2.0% |
| Other values (148) | 22070 | 16.7% |
| (Missing) | 57395 |
| Value | Count | Frequency (%) |
| 0 | 747 | 0.6% |
| 1 | 3225 | |
| 2 | 504 | 0.4% |
| 3 | 765 | 0.6% |
| 4 | 2619 | 2.0% |
| 5 | 4254 | |
| 6 | 7171 | |
| 7 | 6239 | |
| 8 | 7315 | |
| 9 | 4744 |
| Value | Count | Frequency (%) |
| 965 | 1 | |
| 667 | 1 | |
| 558 | 1 | |
| 476 | 1 | |
| 471 | 1 | |
| 320 | 1 | |
| 300 | 1 | |
| 258 | 1 | |
| 215 | 1 | |
| 205 | 1 |
LOW_ENERGY_FIXED_LIGHT_COUNT
Real number (ℝ)
MISSING  SKEWED  ZEROS 
| Distinct | 125 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 89802 |
| Missing (%) | 67.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7.0554207 |
| Minimum | 0 |
|---|---|
| Maximum | 942 |
| Zeros | 6491 |
| Zeros (%) | 4.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.0 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 5 |
| Q3 | 9 |
| 95-th percentile | 20 |
| Maximum | 942 |
| Range | 942 |
| Interquartile range (IQR) | 8 |
Descriptive statistics
| Standard deviation | 11.53721 |
|---|---|
| Coefficient of variation (CV) | 1.6352263 |
| Kurtosis | 1269.7291 |
| Mean | 7.0554207 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | 21.204316 |
| Sum | 299425 |
| Variance | 133.10721 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 6491 | 4.9% |
| 1 | 4349 | 3.3% |
| 4 | 3355 | 2.5% |
| 6 | 3195 | 2.4% |
| 3 | 3168 | 2.4% |
| 5 | 3014 | 2.3% |
| 2 | 3010 | 2.3% |
| 10 | 2983 | 2.3% |
| 8 | 2148 | 1.6% |
| 7 | 2045 | 1.5% |
| Other values (115) | 8681 | 6.6% |
| (Missing) | 89802 |
| Value | Count | Frequency (%) |
| 0 | 6491 | |
| 1 | 4349 | |
| 2 | 3010 | |
| 3 | 3168 | |
| 4 | 3355 | |
| 5 | 3014 | |
| 6 | 3195 | |
| 7 | 2045 | 1.5% |
| 8 | 2148 | 1.6% |
| 9 | 1316 | 1.0% |
| Value | Count | Frequency (%) |
| 942 | 1 | < 0.1% |
| 577 | 1 | < 0.1% |
| 469 | 1 | < 0.1% |
| 320 | 1 | < 0.1% |
| 300 | 1 | < 0.1% |
| 258 | 1 | < 0.1% |
| 200 | 3 | |
| 180 | 1 | < 0.1% |
| 169 | 2 | |
| 166 | 1 | < 0.1% |
WALL_TYPE
Categorical
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.0 MiB |
| Solid brick | |
|---|---|
| Cavity wall | |
| Other | |
| System built | 5171 |
| Timber frame | 4019 |
Length
| Max length | 12 |
|---|---|
| Median length | 11 |
| Mean length | 10.219939 |
| Min length | 3 |
Characters and Unicode
| Total characters | 1351495 |
|---|---|
| Distinct characters | 24 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Solid brick |
|---|---|
| 2nd row | Solid brick |
| 3rd row | Timber frame |
| 4th row | Solid brick |
| 5th row | Solid brick |
Common Values
| Value | Count | Frequency (%) |
| Solid brick | 56524 | |
| Cavity wall | 47804 | |
| Other | 18719 | 14.2% |
| System built | 5171 | 3.9% |
| Timber frame | 4019 | 3.0% |
| Cob | 4 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| solid | 56524 | |
| brick | 56524 | |
| cavity | 47804 | |
| wall | 47804 | |
| other | 18719 | 7.6% |
| system | 5171 | 2.1% |
| built | 5171 | 2.1% |
| timber | 4019 | 1.6% |
| frame | 4019 | 1.6% |
| cob | 4 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 170042 | |
| l | 157303 | 11.6% |
| 113518 | 8.4% | |
| a | 99627 | 7.4% |
| r | 83281 | 6.2% |
| t | 76865 | 5.7% |
| b | 65718 | 4.9% |
| S | 61695 | 4.6% |
| o | 56528 | 4.2% |
| d | 56524 | 4.2% |
| Other values (14) | 410394 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1105736 | |
| Uppercase Letter | 132241 | 9.8% |
| Space Separator | 113518 | 8.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 170042 | |
| l | 157303 | |
| a | 99627 | |
| r | 83281 | 7.5% |
| t | 76865 | 7.0% |
| b | 65718 | 5.9% |
| o | 56528 | 5.1% |
| d | 56524 | 5.1% |
| c | 56524 | 5.1% |
| k | 56524 | 5.1% |
| Other values (9) | 226800 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 61695 | |
| C | 47808 | |
| O | 18719 | 14.2% |
| T | 4019 | 3.0% |
Space Separator
| Value | Count | Frequency (%) |
| 113518 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1237977 | |
| Common | 113518 | 8.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 170042 | |
| l | 157303 | |
| a | 99627 | 8.0% |
| r | 83281 | 6.7% |
| t | 76865 | 6.2% |
| b | 65718 | 5.3% |
| S | 61695 | 5.0% |
| o | 56528 | 4.6% |
| d | 56524 | 4.6% |
| c | 56524 | 4.6% |
| Other values (13) | 353870 |
Common
| Value | Count | Frequency (%) |
| 113518 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1351495 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 170042 | |
| l | 157303 | 11.6% |
| 113518 | 8.4% | |
| a | 99627 | 7.4% |
| r | 83281 | 6.2% |
| t | 76865 | 5.7% |
| b | 65718 | 4.9% |
| S | 61695 | 4.6% |
| o | 56528 | 4.2% |
| d | 56524 | 4.2% |
| Other values (14) | 410394 |
WALL_INSULATION
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.0 MiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 132241 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 1 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 103025 | |
| 1 | 29216 | 22.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 103025 | |
| 1 | 29216 | 22.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 103025 | |
| 1 | 29216 | 22.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 132241 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 103025 | |
| 1 | 29216 | 22.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 132241 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 103025 | |
| 1 | 29216 | 22.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 132241 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 103025 | |
| 1 | 29216 | 22.1% |
FLOOR_TYPE
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.0 MiB |
| Other property below | |
|---|---|
| Suspended | |
| Solid | |
| Other |
Length
| Max length | 20 |
|---|---|
| Median length | 9 |
| Mean length | 12.387656 |
| Min length | 5 |
Characters and Unicode
| Total characters | 1638156 |
|---|---|
| Distinct characters | 18 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Solid |
|---|---|
| 2nd row | Suspended |
| 3rd row | Other property below |
| 4th row | Other property below |
| 5th row | Suspended |
Common Values
| Value | Count | Frequency (%) |
| Other property below | 54853 | |
| Suspended | 38539 | |
| Solid | 30433 | |
| Other | 8416 | 6.4% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| other | 63269 | |
| property | 54853 | |
| below | 54853 | |
| suspended | 38539 | |
| solid | 30433 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 250053 | |
| r | 172975 | |
| p | 148245 | 9.0% |
| o | 140139 | 8.6% |
| t | 118122 | 7.2% |
| 109706 | 6.7% | |
| d | 107511 | 6.6% |
| l | 85286 | 5.2% |
| S | 68972 | 4.2% |
| O | 63269 | 3.9% |
| Other values (8) | 373878 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1396209 | |
| Uppercase Letter | 132241 | 8.1% |
| Space Separator | 109706 | 6.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 250053 | |
| r | 172975 | |
| p | 148245 | |
| o | 140139 | |
| t | 118122 | |
| d | 107511 | |
| l | 85286 | 6.1% |
| h | 63269 | 4.5% |
| b | 54853 | 3.9% |
| w | 54853 | 3.9% |
| Other values (5) | 200903 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 68972 | |
| O | 63269 |
Space Separator
| Value | Count | Frequency (%) |
| 109706 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1528450 | |
| Common | 109706 | 6.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 250053 | |
| r | 172975 | |
| p | 148245 | |
| o | 140139 | |
| t | 118122 | 7.7% |
| d | 107511 | 7.0% |
| l | 85286 | 5.6% |
| S | 68972 | 4.5% |
| O | 63269 | 4.1% |
| h | 63269 | 4.1% |
| Other values (7) | 310609 |
Common
| Value | Count | Frequency (%) |
| 109706 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1638156 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 250053 | |
| r | 172975 | |
| p | 148245 | 9.0% |
| o | 140139 | 8.6% |
| t | 118122 | 7.2% |
| 109706 | 6.7% | |
| d | 107511 | 6.6% |
| l | 85286 | 5.2% |
| S | 68972 | 4.2% |
| O | 63269 | 3.9% |
| Other values (8) | 373878 |
FLOOR_INSULATION
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.0 MiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 132241 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 71255 | |
| 1 | 60986 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 71255 | |
| 1 | 60986 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 71255 | |
| 1 | 60986 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 132241 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 71255 | |
| 1 | 60986 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 132241 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 71255 | |
| 1 | 60986 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 132241 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 71255 | |
| 1 | 60986 |
ROOF_TYPE
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.0 MiB |
| Pitched | |
|---|---|
| Other property above | |
| Other | |
| Flat |
Length
| Max length | 20 |
|---|---|
| Median length | 7 |
| Mean length | 11.528928 |
| Min length | 4 |
Characters and Unicode
| Total characters | 1524597 |
|---|---|
| Distinct characters | 18 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Pitched |
|---|---|
| 2nd row | Pitched |
| 3rd row | Pitched |
| 4th row | Pitched |
| 5th row | Other property above |
Common Values
| Value | Count | Frequency (%) |
| Pitched | 63753 | |
| Other property above | 49603 | |
| Other | 10726 | 8.1% |
| Flat | 8159 | 6.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| pitched | 63753 | |
| other | 60329 | |
| property | 49603 | |
| above | 49603 | |
| flat | 8159 | 3.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 223288 | |
| t | 181844 | |
| r | 159535 | |
| h | 124082 | 8.1% |
| 99206 | 6.5% | |
| o | 99206 | 6.5% |
| p | 99206 | 6.5% |
| i | 63753 | 4.2% |
| P | 63753 | 4.2% |
| d | 63753 | 4.2% |
| Other values (8) | 346971 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1293150 | |
| Uppercase Letter | 132241 | 8.7% |
| Space Separator | 99206 | 6.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 223288 | |
| t | 181844 | |
| r | 159535 | |
| h | 124082 | |
| o | 99206 | |
| p | 99206 | |
| i | 63753 | 4.9% |
| d | 63753 | 4.9% |
| c | 63753 | 4.9% |
| a | 57762 | 4.5% |
| Other values (4) | 156968 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 63753 | |
| O | 60329 | |
| F | 8159 | 6.2% |
Space Separator
| Value | Count | Frequency (%) |
| 99206 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1425391 | |
| Common | 99206 | 6.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 223288 | |
| t | 181844 | |
| r | 159535 | |
| h | 124082 | |
| o | 99206 | 7.0% |
| p | 99206 | 7.0% |
| i | 63753 | 4.5% |
| P | 63753 | 4.5% |
| d | 63753 | 4.5% |
| c | 63753 | 4.5% |
| Other values (7) | 283218 |
Common
| Value | Count | Frequency (%) |
| 99206 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1524597 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 223288 | |
| t | 181844 | |
| r | 159535 | |
| h | 124082 | 8.1% |
| 99206 | 6.5% | |
| o | 99206 | 6.5% |
| p | 99206 | 6.5% |
| i | 63753 | 4.2% |
| P | 63753 | 4.2% |
| d | 63753 | 4.2% |
| Other values (8) | 346971 |
ROOF_INSULATION
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.0 MiB |
| 1 | |
|---|---|
| 0 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 132241 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 0 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 92704 | |
| 0 | 39537 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 92704 | |
| 0 | 39537 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 92704 | |
| 0 | 39537 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 132241 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 92704 | |
| 0 | 39537 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 132241 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 92704 | |
| 0 | 39537 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 132241 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 92704 | |
| 0 | 39537 |
MAIN_FUEL_TYPE
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.0 MiB |
| mains gas | |
|---|---|
| electricity | |
| Other | 8237 |
Length
| Max length | 11 |
|---|---|
| Median length | 9 |
| Mean length | 9.0302176 |
| Min length | 5 |
Characters and Unicode
| Total characters | 1194165 |
|---|---|
| Distinct characters | 15 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | mains gas |
|---|---|
| 2nd row | mains gas |
| 3rd row | mains gas |
| 4th row | mains gas |
| 5th row | mains gas |
Common Values
| Value | Count | Frequency (%) |
| mains gas | 105532 | |
| electricity | 18472 | 14.0% |
| Other | 8237 | 6.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| mains | 105532 | |
| gas | 105532 | |
| electricity | 18472 | 7.8% |
| other | 8237 | 3.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 211064 | |
| s | 211064 | |
| i | 142476 | |
| m | 105532 | |
| n | 105532 | |
| 105532 | ||
| g | 105532 | |
| e | 45181 | 3.8% |
| t | 45181 | 3.8% |
| c | 36944 | 3.1% |
| Other values (5) | 80127 | 6.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1080396 | |
| Space Separator | 105532 | 8.8% |
| Uppercase Letter | 8237 | 0.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 211064 | |
| s | 211064 | |
| i | 142476 | |
| m | 105532 | |
| n | 105532 | |
| g | 105532 | |
| e | 45181 | 4.2% |
| t | 45181 | 4.2% |
| c | 36944 | 3.4% |
| r | 26709 | 2.5% |
| Other values (3) | 45181 | 4.2% |
Space Separator
| Value | Count | Frequency (%) |
| 105532 |
Uppercase Letter
| Value | Count | Frequency (%) |
| O | 8237 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1088633 | |
| Common | 105532 | 8.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 211064 | |
| s | 211064 | |
| i | 142476 | |
| m | 105532 | |
| n | 105532 | |
| g | 105532 | |
| e | 45181 | 4.2% |
| t | 45181 | 4.2% |
| c | 36944 | 3.4% |
| r | 26709 | 2.5% |
| Other values (4) | 53418 | 4.9% |
Common
| Value | Count | Frequency (%) |
| 105532 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1194165 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 211064 | |
| s | 211064 | |
| i | 142476 | |
| m | 105532 | |
| n | 105532 | |
| 105532 | ||
| g | 105532 | |
| e | 45181 | 3.8% |
| t | 45181 | 3.8% |
| c | 36944 | 3.1% |
| Other values (5) | 80127 | 6.7% |
| CURRENT_ENERGY_RATING | PROPERTY_TYPE | BUILT_FORM | TOTAL_FLOOR_AREA | MAINS_GAS_FLAG | FLAT_TOP_STOREY | FLAT_STOREY_COUNT | MULTI_GLAZE_PROPORTION | EXTENSION_COUNT | NUMBER_HABITABLE_ROOMS | NUMBER_HEATED_ROOMS | LOW_ENERGY_LIGHTING | NUMBER_OPEN_FIREPLACES | WIND_TURBINE_COUNT | FLOOR_HEIGHT | PHOTO_SUPPLY | SOLAR_WATER_HEATING_FLAG | CONSTRUCTION_AGE_BAND | FIXED_LIGHTING_OUTLETS_COUNT | LOW_ENERGY_FIXED_LIGHT_COUNT | WALL_TYPE | WALL_INSULATION | FLOOR_TYPE | FLOOR_INSULATION | ROOF_TYPE | ROOF_INSULATION | MAIN_FUEL_TYPE | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 22596 | C | House | Semi-Detached | 171.00 | 1.0 | NaN | NaN | 100.0 | 2.0 | 7.0 | 7.0 | 100.0 | 0.0 | 0.0 | NaN | 0.0 | NaN | England and Wales: 1900-1929 | 24.0 | 24.0 | Solid brick | 0 | Solid | 0 | Pitched | 0 | mains gas |
| 95416 | D | House | End-Terrace | 56.00 | 1.0 | NaN | NaN | 100.0 | 1.0 | 4.0 | 4.0 | 100.0 | 0.0 | 0.0 | NaN | NaN | 0.0 | England and Wales: 1900-1929 | NaN | NaN | Solid brick | 0 | Suspended | 0 | Pitched | 1 | mains gas |
| 118768 | C | Flat | Mid-Terrace | 24.00 | 1.0 | 1.0 | NaN | 100.0 | 0.0 | 1.0 | 1.0 | 100.0 | 0.0 | 0.0 | NaN | NaN | 0.0 | England and Wales: 1900-1929 | NaN | NaN | Timber frame | 0 | Other property below | 1 | Pitched | 1 | mains gas |
| 65619 | C | Flat | Semi-Detached | 33.08 | 1.0 | 1.0 | NaN | 100.0 | 0.0 | 1.0 | 1.0 | 0.0 | 0.0 | 0.0 | 2.40 | 0.0 | NaN | England and Wales: 2003-2006 | 8.0 | 0.0 | Solid brick | 1 | Other property below | 1 | Pitched | 0 | mains gas |
| 19489 | E | Flat | Semi-Detached | 67.00 | 1.0 | 0.0 | NaN | 55.0 | 2.0 | 2.0 | 2.0 | 88.0 | 1.0 | 0.0 | NaN | 0.0 | NaN | England and Wales: 1930-1949 | 8.0 | 7.0 | Solid brick | 0 | Suspended | 0 | Other property above | 1 | mains gas |
| 152082 | B | Flat | Detached | 109.00 | NaN | 0.0 | NaN | 100.0 | NaN | NaN | NaN | 100.0 | 0.0 | 0.0 | 2.55 | NaN | NaN | England and Wales: 2012 onwards | 4.0 | NaN | Other | 0 | Other | 0 | Other property above | 1 | Other |
| 52692 | D | House | Mid-Terrace | 118.00 | 1.0 | NaN | NaN | 100.0 | 0.0 | 5.0 | 5.0 | 58.0 | 0.0 | 0.0 | NaN | 0.0 | NaN | England and Wales: before 1900 | 12.0 | 7.0 | Solid brick | 0 | Solid | 0 | Pitched | 1 | mains gas |
| 23920 | C | Maisonette | End-Terrace | 48.20 | 1.0 | 1.0 | NaN | 100.0 | 0.0 | 2.0 | 2.0 | 0.0 | 0.0 | 0.0 | 2.27 | 0.0 | NaN | England and Wales: 1983-1990 | 14.0 | 0.0 | Cavity wall | 0 | Other property below | 1 | Pitched | 1 | mains gas |
| 19054 | E | House | Mid-Terrace | 77.00 | 1.0 | NaN | NaN | 0.0 | 0.0 | 4.0 | 4.0 | 0.0 | 1.0 | 0.0 | NaN | 0.0 | NaN | England and Wales: before 1900 | 11.0 | 0.0 | Solid brick | 0 | Suspended | 0 | Pitched | 0 | mains gas |
| 68456 | F | House | Detached | 161.00 | 1.0 | NaN | NaN | 100.0 | 0.0 | 7.0 | 7.0 | 100.0 | 0.0 | 0.0 | NaN | NaN | 0.0 | England and Wales: 1930-1949 | NaN | NaN | Cavity wall | 0 | Suspended | 0 | Pitched | 0 | mains gas |
| CURRENT_ENERGY_RATING | PROPERTY_TYPE | BUILT_FORM | TOTAL_FLOOR_AREA | MAINS_GAS_FLAG | FLAT_TOP_STOREY | FLAT_STOREY_COUNT | MULTI_GLAZE_PROPORTION | EXTENSION_COUNT | NUMBER_HABITABLE_ROOMS | NUMBER_HEATED_ROOMS | LOW_ENERGY_LIGHTING | NUMBER_OPEN_FIREPLACES | WIND_TURBINE_COUNT | FLOOR_HEIGHT | PHOTO_SUPPLY | SOLAR_WATER_HEATING_FLAG | CONSTRUCTION_AGE_BAND | FIXED_LIGHTING_OUTLETS_COUNT | LOW_ENERGY_FIXED_LIGHT_COUNT | WALL_TYPE | WALL_INSULATION | FLOOR_TYPE | FLOOR_INSULATION | ROOF_TYPE | ROOF_INSULATION | MAIN_FUEL_TYPE | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 39970 | C | Flat | Detached | 53.68 | 1.0 | 0.0 | 3.0 | 100.0 | 0.0 | 3.0 | 3.0 | 83.0 | 0.0 | 0.0 | 2.30 | 0.0 | 0.0 | England and Wales: 1983-1990 | NaN | NaN | Cavity wall | 1 | Other property below | 1 | Other property above | 1 | mains gas |
| 3770 | B | Flat | NaN | 92.00 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 0.0 | NaN | NaN | NaN | NaN | NaN | 20.0 | 20.0 | Other | 0 | Other property below | 1 | Other property above | 1 | mains gas |
| 20967 | E | Flat | Enclosed End-Terrace | 56.22 | 1.0 | 1.0 | NaN | 100.0 | 0.0 | 3.0 | 3.0 | 96.0 | 0.0 | 0.0 | 2.56 | 0.0 | NaN | England and Wales: 1930-1949 | 28.0 | 27.0 | Solid brick | 0 | Other property below | 1 | Flat | 0 | mains gas |
| 154305 | E | Bungalow | Detached | 82.00 | 1.0 | NaN | NaN | 20.0 | 1.0 | 5.0 | 5.0 | 93.0 | 0.0 | 0.0 | 2.43 | 0.0 | 0.0 | England and Wales: 1976-1982 | 14.0 | NaN | Cavity wall | 1 | Suspended | 0 | Pitched | 1 | mains gas |
| 1025 | B | House | Mid-Terrace | 127.00 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 100.0 | 0.0 | NaN | NaN | NaN | NaN | NaN | 12.0 | 12.0 | Other | 0 | Other | 0 | Other | 0 | mains gas |
| 161582 | D | Maisonette | End-Terrace | 60.00 | 1.0 | 0.0 | NaN | 100.0 | 1.0 | 3.0 | 3.0 | 100.0 | 0.0 | 0.0 | 2.51 | 0.0 | 0.0 | England and Wales: 1900-1929 | 8.0 | NaN | Solid brick | 0 | Suspended | 0 | Other property above | 1 | mains gas |
| 66889 | D | Flat | Mid-Terrace | 39.45 | 0.0 | 0.0 | 5.0 | 0.0 | 0.0 | 2.0 | 2.0 | 0.0 | 0.0 | 0.0 | 2.29 | 0.0 | 0.0 | England and Wales: 1950-1966 | NaN | NaN | Cavity wall | 0 | Other property below | 1 | Other property above | 1 | electricity |
| 30764 | E | House | Semi-Detached | 142.00 | 1.0 | NaN | NaN | 95.0 | 1.0 | 6.0 | 6.0 | 35.0 | 2.0 | 0.0 | NaN | NaN | 0.0 | England and Wales: 1930-1949 | NaN | NaN | Solid brick | 0 | Suspended | 0 | Pitched | 0 | mains gas |
| 9485 | C | House | Mid-Terrace | 67.20 | 1.0 | NaN | NaN | 0.0 | 0.0 | 3.0 | 3.0 | 50.0 | 0.0 | 0.0 | 2.30 | 0.0 | 0.0 | England and Wales: 1996-2002 | NaN | NaN | Cavity wall | 1 | Solid | 1 | Pitched | 1 | mains gas |
| 69600 | D | Flat | Mid-Terrace | 34.00 | 1.0 | 1.0 | NaN | 100.0 | 1.0 | 2.0 | 2.0 | 50.0 | 0.0 | 0.0 | NaN | NaN | 0.0 | England and Wales: 1950-1966 | NaN | NaN | Solid brick | 0 | Other property below | 1 | Pitched | 1 | mains gas |
Most frequently occurring
| CURRENT_ENERGY_RATING | PROPERTY_TYPE | BUILT_FORM | TOTAL_FLOOR_AREA | MAINS_GAS_FLAG | FLAT_TOP_STOREY | FLAT_STOREY_COUNT | MULTI_GLAZE_PROPORTION | EXTENSION_COUNT | NUMBER_HABITABLE_ROOMS | NUMBER_HEATED_ROOMS | LOW_ENERGY_LIGHTING | NUMBER_OPEN_FIREPLACES | WIND_TURBINE_COUNT | FLOOR_HEIGHT | PHOTO_SUPPLY | SOLAR_WATER_HEATING_FLAG | CONSTRUCTION_AGE_BAND | FIXED_LIGHTING_OUTLETS_COUNT | LOW_ENERGY_FIXED_LIGHT_COUNT | WALL_TYPE | WALL_INSULATION | FLOOR_TYPE | FLOOR_INSULATION | ROOF_TYPE | ROOF_INSULATION | MAIN_FUEL_TYPE | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 227 | B | Flat | Detached | 51.0 | NaN | 0.0 | NaN | 100.0 | NaN | NaN | NaN | 100.0 | 0.0 | 0.0 | NaN | NaN | NaN | England and Wales: 2012 onwards | 1.0 | NaN | Other | 0 | Other property below | 1 | Other property above | 1 | Other | 58 |
| 182 | B | Flat | Detached | 50.0 | NaN | 0.0 | NaN | 100.0 | NaN | NaN | NaN | 100.0 | 0.0 | 0.0 | NaN | NaN | NaN | England and Wales: 2012 onwards | 1.0 | NaN | Other | 0 | Other property below | 1 | Other property above | 1 | Other | 53 |
| 4310 | C | Maisonette | Mid-Terrace | 81.0 | 1.0 | 0.0 | NaN | 0.0 | 0.0 | 3.0 | 3.0 | 20.0 | 0.0 | 0.0 | NaN | 0.0 | NaN | England and Wales: 1967-1975 | 10.0 | 2.0 | Cavity wall | 0 | Other property below | 1 | Other property above | 1 | mains gas | 52 |
| 560 | B | Flat | Detached | 71.0 | NaN | 0.0 | NaN | 100.0 | NaN | NaN | NaN | 100.0 | 0.0 | 0.0 | NaN | NaN | NaN | England and Wales: 2012 onwards | 1.0 | NaN | Other | 0 | Other property below | 1 | Other property above | 1 | Other | 40 |
| 1108 | B | Flat | End-Terrace | 69.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 0.0 | NaN | NaN | NaN | NaN | NaN | 9.0 | 9.0 | Other | 0 | Other property below | 1 | Other property above | 1 | Other | 38 |
| 321 | B | Flat | Detached | 54.0 | NaN | 0.0 | NaN | 100.0 | NaN | NaN | NaN | 100.0 | 0.0 | 0.0 | NaN | NaN | NaN | England and Wales: 2012 onwards | 10.0 | NaN | Other | 0 | Other property below | 1 | Other property above | 1 | Other | 36 |
| 424 | B | Flat | Detached | 63.0 | NaN | 0.0 | NaN | 100.0 | NaN | NaN | NaN | 100.0 | 0.0 | 0.0 | NaN | NaN | NaN | England and Wales: 2012 onwards | 10.0 | NaN | Other | 0 | Other property below | 1 | Other property above | 1 | Other | 34 |
| 653 | B | Flat | Detached | 74.0 | NaN | 0.0 | NaN | 100.0 | NaN | NaN | NaN | 100.0 | 0.0 | 0.0 | NaN | NaN | NaN | England and Wales: 2012 onwards | 1.0 | NaN | Other | 0 | Other property below | 1 | Other property above | 1 | Other | 31 |
| 1105 | B | Flat | End-Terrace | 67.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 0.0 | NaN | NaN | NaN | NaN | NaN | 9.0 | 9.0 | Other | 0 | Other property below | 1 | Other property above | 1 | Other | 31 |
| 1377 | B | Flat | Mid-Terrace | 66.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 0.0 | NaN | NaN | NaN | NaN | NaN | 9.0 | 9.0 | Other | 0 | Other property below | 1 | Other property above | 1 | Other | 31 |